Multilingual text-to-phoneme mapping
نویسندگان
چکیده
This paper introduces a novel approach for generating multilingual text-to-phoneme mappings for use in multilingual speech recognition systems. The multilingual mappings are based on the weighted outputs from a neural network text-to-phoneme model, trained on data mixed from several languages. The multilingual mappings used together with a branched grammar decoding scheme is able to capture both interand intra-language pronunciation variations which is ideal for multilingual speaker independent speech recognition systems. A significant improvement in overall system performance was obtained for a multilingual speaker independent name dialing task when applying multilingual instead of language dependent text-to-phoneme mapping.
منابع مشابه
Introduction to multilingual corpus-based concatenative speech synthesis
This tutorial paper addresses foreign-language support in corpus-based concatenative text-to-speech systems. We give an overview of application domains where strictly monolingual speech synthesis is not sufficient and where multilingual text-to-speech is required or highly desirable. We describe two approaches to multilingual corpus-based speech synthesis: phoneme mapping on the one hand, and t...
متن کاملA Hybrid Approach to Bilingual Text-To-Phoneme Mapping
In this paper, we address the problem of bilingual text-to-phoneme (TTP) mapping in which the phonetic transcription of isolated written words must be found. In general, in the bilingual/multilingual TTP mapping for isolated words, two processing steps are applied to each input word. The language of each word is first identified and then the letters of the word are translated into their phoneti...
متن کاملCross-lingual phoneme mapping for multilingual synthesis systems
Development of a multilingual text-to-speech (TTS) system usually requires a lot of time, effort and language resources. Furthermore, the implementation tends to consume large amounts of memory as the number of supported languages increases. This paper proposes a simple method for quickly increasing the language portfolio of an existing TTS system with the minimal effort and memory consumption....
متن کاملCross-language Transfer of Multilingual Phoneme Models
We present a method to use speech data from multiple languages to enhance the performance of a flexible vocabulary command word recognizer which is trained using a small amount of speech data of the target language. We develop data-driven approaches for identification of multilingual phoneme units and mapping of these units to the target language phonemes, and evaluate them against the knowledg...
متن کاملPhoneme lattice based texttiling towards multilingual story segmentation
This paper proposes a phoneme lattice based TextTiling approach towards multilingual story segmentation. The phoneme is the smallest segmental unit in a language and the number of phonemes in a language is usually far smaller than the number of words. Furthermore, many phonemes are shared by different languages. These properties make phonemes particularly appropriate for representing multilingu...
متن کامل